INCLUSive: INtegrated Clustering, Upstream sequence retrieval and motif Sampling

نویسندگان

  • Gert Thijs
  • Yves Moreau
  • Frank De Smet
  • Janick Mathys
  • Magali Lescot
  • Stephane Rombauts
  • Pierre Rouzé
  • Bart De Moor
  • Kathleen Marchal
چکیده

INCLUSive allows automatic multistep analysis of microarray data (clustering and motif finding). The clustering algorithm (adaptive quality-based clustering) groups together genes with highly similar expression profiles. The upstream sequences of the genes belonging to a cluster are automatically retrieved from GenBank and can be fed directly into Motif Sampler, a Gibbs sampling algorithm that retrieves statistically over-represented motifs in sets of sequences, in this case upstream regions of co-expressed genes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating quality-based clustering of microarray data with Gibbs sampling for the discovery of regulatory motifs

In microarray experiments, genes exhibiting a similar expression profile are potentially coregulated. Clustering identifies such groups of coexpressed genes, whose upstream regions can then searched for putative regulatory elements. We present two algorithms and an interactive web-based user interface that integrate cluster analysis and motif finding for the analysis of microarray data. Startin...

متن کامل

INCLUSive: a web portal and service registry for microarray and regulatory sequence analysis

INCLUSive is a suite of algorithms and tools for the analysis of gene expression data and the discovery of cis-regulatory sequence elements. The tools allow normalization, filtering and clustering of microarray data, functional scoring of gene clusters, sequence retrieval, and detection of known and unknown regulatory elements using probabilistic sequence models and Gibbs sampling. All tools ar...

متن کامل

BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes

The development of genome sequencing and DNA microarray analysis of gene expression gives rise to the demand for data-mining tools. BioProspector, a C program using a Gibbs sampling strategy, examines the upstream region of genes in the same gene expression pattern group and looks for regulatory sequence motifs. BioProspector uses zero to third-order Markov background models whose parameters ar...

متن کامل

CSE 527 Lecture 4 , 10 / 08 / 03

Notes by Ana Kristine Torgerson" atorgers@u Case study, continued Sporulation summary What they did Measured mRNA expression levels of all 6200 yeast genes at 7 times points in a (loosely synchronized) sporulating yeast culture Plus some more standard tests and controls What they learned 3-10X increase in number of genes implicated in various subprocesses Several subsequently verified by direct...

متن کامل

Evolutionary Monte Carlo Methods for Clustering

The problem of clustering a group of observations according to some objective function (e.g., K -means clustering, variable selection) or a density (e.g., posterior from a Dirichlet process mixture model prior) can be cast in the framework of Monte Carlo sampling for cluster indicators. We propose a new method called the evolutionary Monte Carlo clustering (EMCC) algorithm, in which three new “...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 18 2  شماره 

صفحات  -

تاریخ انتشار 2002